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1 .  Introduction 


Let  X^j,  — ,  X^n  ,  i  *  1,  — ,  k  be  k  independent  random  samples 
from  populations  with  distribution  functions  Fj,  ....  respectively. 
Suppose  the  null  hypothesis  is  Hq  :  F^  =  ...=  F^  ,  the  hypothesis  of 
homogeneity.  For  various  alternatives,  test  statistics  of  Kolmogorov- 
Smirnov  (briefly,  K-S)  type  have  been  suggested  by  several  authors  (see 
Bimbaum  and  Hall  (1960),  Conover  (1965),  (1967),  Dwass  (1960),  Kiefer 
(1959),  and  Wallenstein  (1980)).  In  this  note  we  consider  some  K-S  type 
statistics.  For  any  set  of  constants  {c^}  i  *  1,  ...»  m  (m  being  a 
positive  integer),  j  =  1,  ...,  k  ,  with  c^  ^  0  ,  we  define  two  classes 
of  K-S  type  statistics  as  follows: 


(a)  Two-sided  statistics: 


U(k)  =  max<sup 
i  ^.x 


k 

l 

j-1 


U 


Fj  (x) 


(b)  One-sided  statistics: 


U+(k)  =  max/sup(  £  c..  F. 

i  v.  x  'j=l  } 

* 

where  F^  is  the  empirical  distribution  function  of  the  jth  sample.  As 
an  example  of  the  use  of  these  statistics,  consider  the  alternatives  that 
one  of  the  distribution  functions,  say  F^  ,  is  different  from  the  rest 
or  greater  (less)  than  the  rest,  i.e.. 


Hj  :  Fj  f  ,  i  -  2,  ...,k, 

H2  :  Fx  >  (<)F.  ,  i  *  2,  ....  k  . 

.  #  * 

Then  one  may  use  K-S  type  statistics  U, (k)  »  max{sup|F. (x)  -  F. (x)  | } 

i  x 

and  U«(k)  «  max{sup(Fr(x)  -  F?(x))  }(U7(k)  *  max{sup(F.  (x)  -  F^x))}) 

*  i  x  1  1  i  x  1 


for  testing  against  Hj  and  H2  respectively.  Observe  that  the  first 
one  is  a  special  case  of  (a)  whereas  the  second  is  a  special  case  of  (b) . 
In  both  cases  c^  ^  0. 


When  m 
k 


1  ,  we  get  sup 
x 


sup 

X 


l  c..  F . (x)  |  from  (a)  and 
j=l  J  J 

(,!,  cij  Vx>) 

are  discrete  in  nature,  then  distributions  of  £  sup  £  c..  F.(x) 

.  i  x  'j=l  ^ 

/  k  *  \ 

2,  supl  I  c. .  F .  (x)  i  for  finite  sums  can  be  determined  in  a  routine  way. 
i  x  ^j=l  1J  J  '  *  * 

Thus  distributions  of  statistics  U2(k)  =  £  sup|Fj(x)  -F^(x)|  for  testing 


from  (b) .  Once  we  ascertain  these  distributions  which 

k 

and 


against  Hj  and  U2(k)  =  l  sup(Fj(x)  -  F^x))  (U,,(k)  =  £  sup(F.  (x)  -  F.(x))) 

i  x  i 

for  testing  against  H2  can  be  derived  from  a  set  of  two-sample  statis¬ 
tics  Uj(2),  Uj(2)  (Uj(2))  respectively. 

Let  us  consider  testing  the  goodness-of-fit  hypothesis  that  the 

sanples  are  from  a  specified  distribution  with  distribution  function  FQ. 

* 

It  is  well  known  that  if  we  apply  the  transform  X  *  FQ(X)  to  the  origi- 

* 

nal  data,  the  null  hypothesis  becomes  Hq  :  F^(x)  =...=  F^(x)  =  x  for 
x  e  (0,1)  ,  i.e.,  every  distribution  is  uniform  over  (0,1).  Analogous 
to  (a)  and  (b),  define  the  following  K-S  type  statistics  for  c^  k+1  t  0  : 

(a*)  Two-sided  statistics: 

V(k)  =  max^sup^^  c..  f!(x)  +  c.>k+1  x|J  , 

(b*)  One-sided  statistics: 

V+(k)  »  max^sup(^  cy  fJ(x)  *  c.>k+1  x)j  . 

In  particular,  statistics  V. (k)  ■  max{sup|x  -  F, (x) | }  and  Vj(k)  ■ 

*  i  x 

max { sup (x  -  F,(x))}  may  be  used  when  the  alternatives  are  Hj  :  x  j*  Fi(x), 
i  x  * 

i  »  1, . . .  ,k  and  H2:x>Fi(x),  i  -  1,  ...,  k  respectively.  Notice  that 


2 


we  have  set  c 


=  1.  Discussion  similar  to  the  last  paragraph  can 


i,k+l 

also  be  dealt  with. 

In  this  note,  we  derive  the  distributions  of  U(k),  U+(k),  V(k),  V+(k) 
and  the  special  cases. 

2.  Auxiliary  Results 

For  completion,  we  present  below  the  combinatorial  result  on  multidi¬ 
mensional  lattice  paths  in  Handa  and  Mohanty  (1979) ,  for  which  some  de¬ 
finitions  and  notations  are  needed.  Consider  (k+1) -dimensional  lattice 
paths  from  the  origin  to  the  point  (n0*nj, . . • ,n^) .  By  the 

•  •  •  ,rk))th  level  we  mean  the  set  of  points  {(xo,nj-rj, . . . , 

nk"rk  :  0  1  xo  -  no^‘  Denote  a(r)  and  Mr)  the  upper  and  lower 
restrictions  at  the  rth  level,  by  which  we  mean  the  path  at  the  rth  level 
can  pass  only  through  points  in  the  set 

{(Wrp  •  •  •  »IVric)  :  0  1  Mr)  <  nQ-xo  <  a(r)  <  nQ}  . 

The  sets 

A(n)  =  {a(r)  :  £  <  r  <  n } 

and 

B(n)  =  (b(r)  :  0.  £  r  <  n } 

are  respectively  called  the  upper  and  lower  restrictions  on  the  path. 

The  order  relation  x  £  means  y^  for  each  i.  Note  that  a(r) 

and  b(r)  are  non-negative  integers  and  nondecreasing  in  each  coordinate. 
Also  if  a(r)  <  b(r)  ,  the  path  cannot  pass  through  that  level. 

Let  x  o  jr  mean  x^  <  y^  for  at  least  one  i.  For  example,  the 
lexicographic  ordering  (Uj, . . .  ,u^)  of  the  set  {r  :  0  <_  r  <_  n)  such 


! 
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k 

that  d  a  II  (n. +1) ,  u j  *  0  and  =  n  is  an  a  ordering  in  the  sense 
i=l  ’~1* 

that  UjO. ••own*  Remember  that  the  sequence  ((0,0),  (0,1),  (1,0),  (1,1), 
(2,0),  (2,1))  is  a  lexicographic  ordering  of  vectors  (r  :  (0,0)  <  (rj,r2) 
<  (2,1)}. 

Denote  by  g.(A(n) |B(n))  the  number  of  paths  with  upper  restriction 
K  k 

A(n)  and  lower  restriction  B(n) .  Let  n!  =  II  n.!  , 

i=l 


Proposition  1. 

gk(A(n) |B(n))  satisfies  the  recurrence  relation 

l  -  MS)  *  1^  gk(AW|B(r))  =  j| 

0<r<n  -  ~ 

where 


=  1  when  n  =  0  , 
o  —  — 

*  0  otherwise. 


(1) 


An  explicit  solution  of  (1)  is  the  following: 


d-<n>l 

gkCA(n)|B(n))  .  (-1) 


(2) 

(d-1)  x  (d-1) 


k 

where  d»  II  (n^+1),  llcij  H  nxn»  *s  t**e  ®*m  determinant  with  (i,j)th  element 
i«l  i 

Cy  and  (uj, . .  ..Uj}  *  {r  :  0£rj<  n)  such  that  £  ■  u^a. .  .cWj  ■  n. 
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The  explicit  expression  (2)  may  be  obtained  by  first  using  Cramer's 
rule  to  the  system  (1)  of  linear  equations  and  then  simplifying  it.  For 
practical  purposes,  the  lexicographic  ordering  of  vectors  is  good  enough. 

It  is  easily  seen  that  if  we  take  (n^.nj, . . .  ,nk)  as  the  origin  and  re¬ 
verse  the  steps  in  the  path,  the  upper  and  lower  restrictions  repectively 
become 

A'  (n)  =  {nQ  -  Mn  -  r)  :  0  £  r  £  n) 

and 

B'  (n)  =  {no  -  a(n  -  £.):££££  n)  }  . 

In  other  words,  the  upper  and  lower  restrictions  a'(r)  and  b'(r)  are 
such  that  any  path  can  only  pass  through  points  {(xo,rj, . . . ,r.  )  : 

0  £b'(r)  £  xq  £  a'(r)  £  nQ}  for  every  r  in  the  original  coordinate 
system.  Then  we  get  an  alternative  expression  for  the  same  number  of 
paths  as  g^fA' (n) |B' (n)) .  It  may  be  noted  that  for  computational  pur¬ 
poses,  recurrence  relation  (1)  is  more  often  useful  than  explicit  ex¬ 
pression  (2)  although  the  determinant  contains  a  lot  of  zeros. 

Next  we  formulate  a  continuous  analogue  of  Proposition  1  which  gives 
a  generalization  of  Steck's  result  (1971).  Though  Steck  has  stated  the 
result  in  terms  of  order  statistics,  its  connection  with  paths  is  explained 
in  Mohanty  (1979)  chapters  2  and  4.  In  (k+1) -dimension  with  axes  xQ, 

Xj,  ....  x^  consider  paths  (not  necessarily  lattice  paths)  from  the  origin 

to  (nQ,nj, . . . .n^)  where  nQ  is  a  non-negative  real  number  and  n^ . 

n^  are  non-negative  integers.  In  this  case,  a  path  is  like  a  lattice  path 
except  that  the  number  of  units  moved  at  any  time  on  xQ-axis  is  a  non-ne¬ 
gative  real  number.  As  before,  we  may  define  the  level  r  ,  the  upper 
restriction  A(n)  and  the  lower  restriction  B(n) .  Here,  we  may  remember 
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that  a(r)  and  b(r)  are  non-negative  real  numbers.  In  the  next  asser¬ 
tion,  we  adopt  some  of  the  earlier  notations. 

Proposition  2. 

Let  gk(A(n) jB(n))  be  the  measure  of  the  set  of  paths  with  upper 
restriction  A(n)  and  lower  restriction  B(n).  Then  (A (n) | B (n) ) 

satisfies  the  recurrence  relation 


-,.r>  CP  -  M.». 

£  t_1J  (n  -  r)! 


<n-r  > 


0  <  r  <  n 


gk(A(r) j  B (r) ) 


<$— 


(3) 


where  Cx)+  =  max(0,x) .  An  explicit  solution  of  (3)  is  given  by 


gk(A(n)|B(iO) 


(-l)d"<v— '‘"I 


(a(ut)  -  b(uj^1))^ 


<u.  ,  -  u.> 
— j+1  — i 


%+!  -  *i)! 


(d-1)  X  (d-1) 


(4) 


The  proof  is  inductive,  follows  the  similar  line  as  in  Proposition 
1,  and  therefore  is  omitted.  Remarks  following  Proposition  1  are  also 
true  here.  In  our  application  we  will  use  the  alternative  expressions 
in  both  cases.  Without  ambiguity,  instead  of  A’(n)  and  B'(n)  we 
may  use  A(n)  and  B(n)  with  corresponding  a(r)  and  b(r) . 


3.  Distributions 

As  in  the  two-sample  case,  we  pool  all  samples  and  order  the  obser¬ 
vations  from  the  smallest  to  the  largest.  Let  -  i-1  if  the  jth  mem¬ 
ber  (j  «  1, . . .  ,nj+. .  .+nk)  in  the  ordered  sequence  is  4-1,  n^, 

i  ■  1,  ....  k.  It  can  be  proved  that 
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5 


P(Z. 


sl* 


. . ,  Z 


n^+. .  .-Ki^ 


nl 


+  • 


(V-^y1 

\  ,11^/ 


( 


In  Che  same  way  as  in  Che  case  of  k  *  2.  RepresenC  Che  sequence  of  Z's 

as  a  k-dimensional  laCCice  pach  from  Che  origin  Co  (n^ . nfc)  by  puCCing 

a  unic  on  axis  whenever  Z^  =  i,  i  *  0,  ....  k-1,  j  *  1,  ....  n^+. .  .+n. 

Because  of  (5)  each  pach  is  equally  likely.  Also,  by  Che  virCue  of  Cheir 
definicions  sCaCisCics  U(k)  and  U+(k)  can  be  convenienCly  represenCed 
as  characCerisCics  of  Chese  laCCice  pachs.  1c  can  be  verified  wichout 
much  difficulCy  chaC  for  c  >  0  , 


/nl+* 
P(U(k)  <  c)  =  Ml 

\11^  *  • 


and  for  any  c  , 


P(U+(k)  <  c)  -  J1 

—  \n^ , . 


where  M  is  Che  number  of  pachs  such  ChaC  a  poinC  (xQ,x^, 

k  i 


vk-l 


)  on 


J1cijIj-i/nj|-c- i-1- 


any  paCh  saCisfies  Che  inequalicles 
and  N  is  Che  number  of  pachs  sacisfying  Che  inequalicies 

(  ^  cij  xj_]/nj)  lc»  1  “  •••»  “•  Suppose  cn  >  0.  Then,  using 

Proposition  1  (its  alternative  form)  it  turns  out  that 
M  -  gfc_i  (A(n)  |  B  (n)  )  with 


m 


i(r)  .  .in^,  Cy  ^1)|.  i  *  1. 

’•  L&-  •  i  c«  ^f)J- 


»o 


b(r)  »  maxlO, 


i  • 


where  fal  is  the  largest  integer  £  a,  and  Laj  is  the  smallest  integer 
i  a,  and  N  =  gkl (A(n) | B(n))  with  a(r)  as  above  and  b(r)  *  0.  Also 
{c.^}  must  be  such  that  the  non-decreasing  property  of  a(r)  and  b(r) 
has  to  be  satisfied.  This  is  also  true  in  the  derivation  of  the  distri¬ 
butions  of  V(k)  and  V+(k).  For  the  example  of  U^(k),  a(r)  and  b(r) 
simplify  to 

/  fnl  1  \ 

a(r)  =  mining  —  (r..  j  +  cn^j  ,  j  =  2,...,kj 

/  "1  \ 
b(r)  =  maxi 0,  —  (r^_1  -  cn..)  ,  j  =  2,...,k j  . 

When  cA1  <  0  ,  the  modification  is  obvious  and  therefore  is  omitted. 

Next  let  us  consider  the  distributions  of  V(k)  and  V+(k).  Assume 
ci  k+1  >  0.  Under  the  null  hypothesis  it  can  be  shown  by  using  Propo¬ 
sition  2  that 


P(V(k)  <  c)  =  n !  gk(A(n)|B(n)) 


where 


I  i  !  k  r.  \ 

.(r)  -  “in' I,  j—  (c  -  l  c  -1  -  c  —  ,  .1*1 . k;  1  -  1 . «l 

\  i»k+l  v  j=l  j  *  /  / 

\  M  i 


—  \  c.  .  .  '  ij  n.  ik  n.  .  ' 

\  i»k+l  v  j=l  j  *  / 

\  W 

b(r)  •  max!  0,  — - f-c  -  l  c..  ,  i  =  l,...,m  j  . 

\  ci,k+l  '  j-1  ny  J 


Notice  the  change  in  a(r)  which  is  consistent  with  the  well  known  ex¬ 
pression  when  k  *  1.  The  underlying  argument  is  an  extended  version  of 
the  one-sample  case.  For  V+ (k)  ,  the  expression  is  the  same  except  that 
b(r)  ■  0.  In  the  particular  case  of  V^Ck),  a(r)  and  b(r)  become 
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a(r)  =  min^l,  +  c,  j  =  1, . . .  ,kj 


b(r)  =  max^O,  -  c,  j  =  . 


The  expression  for  V^k)  is  obtained  from  that  of  V  (k) 
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The  Exact  Distributions  of  Some  Kolmogorov-Smirnov 
Type  k-sample  Statistics 

I 

Let  X^,  ....  xin  ,  i  =  1,  ....  k  be  k  independent  random  samples 
from  populations  with  distribution  functions  Fj,  . respectively. 

For  the  hypothesis  of  homogeneity  :  F^  «...=  F^  ,  we  obtain  the  dis¬ 
tributions  of  two  Kolmogorov-Smirnov  type  statistics  maxisupl  ][  c. .  F.(x) 

r  /  r  *  *j*i  13  3 

and  max<5up(  2,  c.  .  F-(x)j>  where  {c..}  is  a  set  of  constants  with 

i  Lx  \j=l  3  '■>  3 

* 

f  0  and  Fj  is  the  empirical  distribution  function  of  the  jth  sam¬ 
ple.  Also,  for  the  goodness-of-fit  hypothesis  similar  statistics  are 
considered  and  their  distributions  derived.  Some  special  cases  are  dis¬ 


cussed. 


